A Text Mining Approach for Definition Question Answering

نویسندگان

  • Claudia Denicia-Carral
  • Manuel Montes-y-Gómez
  • Luis Villaseñor Pineda
  • René Hernández
چکیده

This paper describes a method for definition question answering based on the use of surface text patterns. The method is specially suited to answer questions about person’s positions and acronym’s descriptions. It considers two main steps. First, it applies a sequence-mining algorithm to discover a set of definition-related text patterns from the Web. Then, using these patterns, it extracts a collection of concept-description pairs from a target document database, and applies the sequence-mining algorithm to determine the most adequate answer to a given question. Experimental results on the Spanish CLEF 2005 data set indicate that this method can be a practical solution for answering this kind of definition questions, reaching a precision as high as 84%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Machine Learning and Text Mining in Question Answering

This paper describes a QA system centered in a full data-driven architecture. It applies machine learning and text mining techniques to identify the most probable answers to factoid and definition questions respectively. Its major quality is that it mainly relies on the use of lexical information and avoids applying any complex language processing resources such as named entity classifiers, par...

متن کامل

INAOE at CLEF 2006: Experiments in Spanish Question Answering

This paper describes the system developed by the Language Technologies Lab at INAOE for the Spanish Question Answering task at CLEF 2006. The presented system is centered in a full datadriven architecture that uses machine learning and text mining techniques to identify the most probable answers to factoid and definition questions respectively. Its major quality is that it mainly relies on the ...

متن کامل

Text Mining in Biograph

The Biograph project is a biomedical knowledge discovery project combining graph data mining with structured biomedical information and with text mining on medline abstracts. It is a cooperation between the molecular genetics, data mining, and computational linguistics research groups of the University of Antwerp. In this talk, I will outline the general architecture of the system, which is cur...

متن کامل

Semantic Content Access Using Domain-Independent NLP Ontologies

We present a lightweight, user-centred approach for document navigation and analysis that is based on an ontology of text mining results. This allows us to bring the result of existing text mining pipelines directly to end users. Our approach is domain-independent and relies on existing NLP analysis tasks such as automatic multi-document summarization, clustering, question-answering, and opinio...

متن کامل

Mining Paraphrasal Typed Templates from a Plain Text Corpus

Finding paraphrases in text is an important task with implications for generation, summarization and question answering, among other applications. Of particular interest to those applications is the specific formulation of the task where the paraphrases are templated, which provides an easy way to lexicalize one message in multiple ways by simply plugging in the relevant entities. Previous work...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006